Avoid double evaluation of pattern matchers in Chunk.collect #3364

flipp5b · 2023-12-20T08:13:49Z

fs2.Chunk#collect evaluates pattern matchers and guards of a passed partial function twice because it calls pf.isDefinedAt and then pf.apply:

// ...
foreach(o => if (pf.isDefinedAt(o)) b += pf(o))
// ...

Scala collections use pf.applyOrElse/pf.runWith (depending on scala version) instead to avoid double evaluation. This PR implements the same approach in a Chunk.

It should be noted that this may change observable behavior for those who use a partial function with side-effecting matchers/guards. But hey, side-effecting matchers/guards is considered a bad practice anyway, and I doubt that there's an fs2 user relying on that double evaluation "feature."

flipp5b · 2023-12-20T09:05:37Z

core/shared/src/main/scala/fs2/Chunk.scala

@@ -87,7 +87,8 @@ abstract class Chunk[+O] extends Serializable with ChunkPlatform[O] with ChunkRu
  def collect[O2](pf: PartialFunction[O, O2]): Chunk[O2] = {
    val b = makeArrayBuilder[Any]
    b.sizeHint(size)
-    foreach(o => if (pf.isDefinedAt(o)) b += pf(o))
+    val f = pf.runWith(b += _)
+    foreach { o => f(o); () }


Well, this looks rather ugly. pf.runWith returns O => Boolean so we have to transform it to O => Unit to avoid warnings. This could be rewritten as follows (at the cost of extra lambda call per Chunk item):

foreach(pf.runWith(b += _).andThen(_ => ()))

@armanbilge, which one would you prefer?

Ugly but performant is the right choice 😅

flipp5b · 2023-12-20T11:19:45Z

Here is some benchmark results.

1. Master

foreach(o => if (pf.isDefinedAt(o)) b += pf(o))

Benchmark               (chunkSize)   Mode  Cnt        Score       Error  Units
ChunkBenchmark.collect           16  thrpt   25  2828318.069 ± 72556.887  ops/s
ChunkBenchmark.collect          256  thrpt   25   191183.476 ±   581.751  ops/s
ChunkBenchmark.collect         4096  thrpt   25    11461.704 ±   504.370  ops/s

2. This PR

    val f = pf.runWith(b += _)
    foreach { o => f(o); () }

Benchmark               (chunkSize)   Mode  Cnt        Score       Error  Units
ChunkBenchmark.collect           16  thrpt   25  3373807.015 ± 25730.144  ops/s
ChunkBenchmark.collect          256  thrpt   25   257101.963 ±  7552.952  ops/s
ChunkBenchmark.collect         4096  thrpt   25    15417.548 ±   333.196  ops/s

3. This PR (prettified)

foreach(pf.runWith(b += _).andThen(_ => ()))

Benchmark               (chunkSize)   Mode  Cnt        Score       Error  Units
ChunkBenchmark.collect           16  thrpt   25  3420711.048 ± 90333.134  ops/s
ChunkBenchmark.collect          256  thrpt   25   242847.544 ±  4885.911  ops/s
ChunkBenchmark.collect         4096  thrpt   25    14369.581 ±   442.678  ops/s

Option 2 (this PR) shows slightly better score than the other two options.

armanbilge

Thank you for the very thorough PR, this is very nice work 🙏

flipp5b · 2023-12-20T20:16:13Z

@armanbilge Thank you for the prompt and kind feedback :)

flipp5b force-pushed the optimize-chunk-collect branch from 7df3227 to 1c3ed3d Compare December 20, 2023 08:39

flipp5b commented Dec 20, 2023

View reviewed changes

Avoid double evaluation of pattern matchers in Chunk.collect

4e146de

flipp5b force-pushed the optimize-chunk-collect branch from 1c3ed3d to 4e146de Compare December 20, 2023 11:25

armanbilge approved these changes Dec 20, 2023

View reviewed changes

mpilquist merged commit f5d3587 into typelevel:main Dec 23, 2023
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid double evaluation of pattern matchers in Chunk.collect #3364

Avoid double evaluation of pattern matchers in Chunk.collect #3364

flipp5b commented Dec 20, 2023

flipp5b Dec 20, 2023 •

edited

Loading

armanbilge Dec 20, 2023

flipp5b commented Dec 20, 2023 •

edited

Loading

armanbilge left a comment

flipp5b commented Dec 20, 2023

Avoid double evaluation of pattern matchers in Chunk.collect #3364

Avoid double evaluation of pattern matchers in Chunk.collect #3364

Conversation

flipp5b commented Dec 20, 2023

flipp5b Dec 20, 2023 • edited Loading

Choose a reason for hiding this comment

armanbilge Dec 20, 2023

Choose a reason for hiding this comment

flipp5b commented Dec 20, 2023 • edited Loading

1. Master

2. This PR

3. This PR (prettified)

armanbilge left a comment

Choose a reason for hiding this comment

flipp5b commented Dec 20, 2023

flipp5b Dec 20, 2023 •

edited

Loading

flipp5b commented Dec 20, 2023 •

edited

Loading